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CDMA FOR HDMAMT MgTOYlJBaiETBTRAHYDROPQIATE REDDCTASB 

BACKCatOmiD OF THB INVENTION 

(a) Field of the Invention 

5 The invention relates to a cDNA probe for human 

methylenetetrahydrofolate reductase (MTHFR), and its 
uses . 

(b) Description of Prior Art 

Folic acid derivatives are coenzymes for 

10 several critical single-carbon transfer reactions, 
including reactions in the biosynthesis of purines, 
thymidylate and methionine . Methylenetetrahydrofolate 
reductase (MTHFR? EC 1.5,1.20) catalyzes the NADPH- 
linked reduction of 5,10-methylenetetrahydrofolate to 

15 5-raethyltetrahydrof olate, a co-substrate for methyla- 
tion of homocysteine to methionine. The porcine liver 
enzyme, a f lavoprotein, has been purified to homogene- 
ity; it is a homodimer of 77-kDa subunits* Partial 
proteolysis of the porcine peptide has revealed two 

20 spatially distinct domains: an N-terminal domain of 40 
kDa and a C*terminal domain of 37 kDa. The latter 
domain contains the binding site for the allosteric 
regulator S-adenosylmethionine. 

Hereditary deficiency of MTHFR, an autosomal 

25 recessive disorder, is the most common inborn error of 
folic acid metabolism- A block in the production of 
methyltetrahydrofolate leads to elevated homocysteine 
with low to noirmal levels of methionine. Patients 
with severe deficiencies of MTHFR (0 -20% activity in 

30 fibroblasts) can have variable phenotypes. Developmen- 
tal delay, mental retardation, motor and gait abnor- 
malities, peripheral neuropathy, seizures and psychi- 
atric disturbances have been reported in this group, 
although at least one patient with severe MTHPR defi- 

35 ciency was asymptomatic. Pathologic changes in the 
severe form include the vascular changes that have 
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been found in other conditions with elevated homocys- 
teine, as well as reduced neurotransmitter and 
methionine levels in the CNS. A milder deficiency of 
MTHFR (35-50% activity) has been described in patients 
5 with coronary artery disease (see below).. Genetic 
heterogeneity is likely, considering the diverse 
clinical features, the variable levels of enzyme 
activity, and the differential heat inactivation pro- 
files of the reductase in patients' cells- 

10 Coronary artery disease (CAD) accounts for 25% 

of deaths of Canadians. Cardiovascular risk factors 
(male sex, family history, smoking, hypertension, 
dyslipoproteineraia and diabetes) account for approxi- 
mately 60 to 70% of our ability to discriminate CAD 

15 patients from healthy subjects. Elevated plasma homo- 
cysteine has also been shown to be an independent risk 
factor for cardiovascular disease. 

Homocysteine is a sulfhydryl-containing amino 
acid that is foarmed by the demethylation of 

20 methionine* It is normally metabolized to cysteine 
(transsulfuration) or re-methylated to methionine. 
Inborn errors of metabolism (as in severe MTHFR defi- 
ciency) causing extreme elevations of homocysteine in 
plasma, with homocystinuria, are associated with pre- 

25 mature vascular disease and widespread arterial and 
venous thrombotic phenomena « Milder elevations of 
plasma homocysteine (as in mild MTHFR deficiency) have 
been associated with the development of peripheral 
vascular disease, cerebrovascular disease and 

30 premature CAD. 

Homocysteine remethylation to methionine 
requires the folic acid intermediate, 5 -methyl tetr ahy- 
drofolate, which is produced from 5, 10-methylenetet- 
rahydrof olate folate through the action of 5,10- 

35 raethylenetetrahydrof olate reductase (MTHFR) . Def i- 
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ciency of MTHFR results in an inability to metabolize 
homocysteine to methionine; elevated plasma homocys- 
teine and decreased methionine are the metabolic con- 
sequences of the block. Severe deficiencies of MTHFR 
5 (less than 20% of activity of controls) as described 
above, are associated with early-onset neurologic 
symptoms (mental retardation, peripheral neuropathy, 
seizures, etc*) and with atherosclerotic changes and 
thromboembolism. Milder deficiencies of MTHFR (35-50% 

10 of activity of controls), with a thermolabile form of 
the enzyme, are seen in patients with cardiovascular 
disease without obvious neurologic abnormalities. 

In a survey of 212 patients with proven coro- 
nary artery disease, the thermolabile form of MTHFR 

15 was found in 17% of the CAD group and 5% of controls - 
In a subsequent report on 339 subjects who underwent 
coronary angiography, a correlation was found between 
thermolabile MTHFR and the degree of coronary artery 
stenosis. Again, traditional risk factors (age, sex, 

20 smoking, hypertension, etc.) were not significantly 
associated with thermolabile MTHFR. All the studies on 
MTHFR were performed by enzymatic assays of MTHFR in 
lymphocytes, with measurements of activity before and 
after heat treatment to determine thermolability of 

25 the enzyme. 

Since 5-methyltetrahydrofolate, the product 
of the MTHFR reaction, is the primary form of circula- 
tory folate, a deficiency in MTHFR might lead to other 
types of disorders. For example, periconceptual folate 

30 administration to women reduces the occurrence and 
recurrence of neural tube defects in their offspring. 
Neural tube defects are a group of developmental mal-- 
formations ( meningomyelocele , anencephaly , encephalo- 
cele) that arise due to failure of closure of the neu- 

35 ral tube. Elevated levels of plasma homocysteine have 
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been reported in mothers of children with neural tube 
defects* The elevated plasma homocysteine could be due 
to a deficiency of MTHFR, as described above for car- 
diovascular disease. 
5 Neuroblastomas are tumors derived from neural 

crest cells. Many of these tumors have been reported 
to have deletions of human chromosome region lp36, the 
region of the genome to which MTHFR has been mapped. 
It is possible that MTHFR deletions/mutations are 

10 responsible for or contribute to the formation of this 
type of tumor. MTHFR abnormalities may also contribu- 
tion to the fomation of other types of tumors, such 
as colorectal tumors, since high dietary folate has 
been shown to be inversely associated with risk of 

15 colorectal adenomas. 

MTHFR activity is rec[uired for homocysteine 
methylation to methionine. Methionine is necessary for 
the formation of S-adenosylmethionine, the primary 
methyl donor for methylation of DNA, proteins, lipids, 

20 neurotransmitters, etc. Abnormalities in MTHFR might 
lead to lower levels of methionine and S-adenosylme- 
thionine, as well as to elevated homocysteine- Disrup- 
tion of methylation processes could result in a wide 
variety of conditions, such as neoplasias, developmen- 

25 tal anomalies, neurologic disorders, etc. 

Although the MTHFR gene in jEschericJiia coli 
imetF) has been isolated and sequenced, molecular 
studies of the enzyme in higher organisms have been 
limited without the availability of a eukaryotic cDNA. 

30 In this communication, we report the isolation of a 
human cDNA for MTHFR, its chromosomal assignment, and 
the identification of mutations in MTHFR-def icient 
patients. This report represents the first molecular 
description of mutations in MTHFR deficiency. 
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It would be highly desirable to be provided 
with a cDNA probe for human methylenetetrahydrofolate 
reductase (MTHFR). This probe would be used for iden- 
tification of sequence abnormalities in individuals 
5 with severe or mild MTHFR deficiency, including car- 
diovascular patients and patients with neurologic 
symptoms or tumors* The probe would also be used in 
gene therapy, isolation of the gene, and expression 
studies to produce the MTHFR protein* The probe would 
10 also provide the eunino acid sequence of the human 
MTHFR protein, which would be useful for therapy of 
MTHFR deficiency by biochemical or pharmacological 
approaches . 

It would be highly desirable to be provided 
15 with a molecular description of mutations in methyl- 
enetetrahydrofolate reductase deficiency* 

SOMMftRY OP THK IHVENTION 

One aim of the present invention is to provide 
20 a cDNA probe for human methylenetetrahydrofolate 
reductase (MTHFR). 

Another aim of the present invention is to pro- 
vide a molecular description of mutations in methyl- 
enetetrahydrofolate reductase deficiency* 
25 Another aim of the present invention is to pro- 

vide a nucleic acid and amino acid sequence for human 
methylenetetrahydrofolate reductase * 

Another aim of the present invention is to pro- 
vide potential therapy for individuals with methyl- 
30 enetetrahydrofolate reductase deficiency. 

Another aim of the present invention is to pro- 
vide a system for synthesis of MTHFR protein in vitro. 

A further aim of the present invention is to 
provide for a technology/protocol for identification 
35 of sequence changes in the MTHFR gene* 
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BRIKF DESCRIPTION OF THE DRAWINGS 

Figs* lA to IE illustrate the first cDNA coding 
sequence for methylenetetrahydrofolate reductase 
5 (MTHFR); 

Fig. 2 is the alignment of amino acids for 
human methylenetetrahydrofolate reductase (MTHFR), the 
metF genes from E. coii (ECOMETF), and S. typhimurivm 
(STYMETF), and an unidentified open reading frame in 
10 Saccharomyces cerevlsiae that is divergently tran* 
scribed from an excision repair gene (ysRADl); 

Figs. 3A and 3B illustrate the sequencing and 
restriction enzyme analysis for the Arg to Ter substi- 
tution; 

15 Figs* 4A and 4B illustrate the sequencing and 

restriction enzyme analysis for the Arg to Gin substi-^ 
tut ion; 

Figs- 5A and 5B illustrates the sequence change 
and restriction enzyme analysis for the alanine to 
20 valine substitution; 

Fig. 6 illustrates the total available sequence 
of human MTHFR cDNA; 

Figs. 7A and 7B illustrate the expression 
analysis of MTHFR cDNA in E, coll, respectively (7A) 
25 the Western blot of bacterial extracts and tissues, 
and (7B) the thermolability assay of bacterial 
extracts ; 

Figs. 8A to 8D illustrate the identification of 
a 5' splice site mutation leading to a 57 bp in-frame 
30 deletion of the cDNA; 

Figs- 9A to 9D illustrate the diagnostic 
restriction endonuclease analysis of 4 mutations; 

Figs- lOA to lOD illustrate the ASO hybridiza- 
tion analysis of 2 mutations; and 
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Pig. 11 illustrates the region of homology 
between human methylenetetrahydrofolate reductase 
(MTHFR) and human dihydrof olate reductase (DHPR). 

5 DETAILED DBSCRIPTION OP THE IHVgOTIOlI 

Sequencing of peptides £rom porcine MTHFR 

Homogeneous native porcine MTHFR was digested 
with trypsin to generate a 40 kOa N-terminal fragment 

10 and a 31 kDa C-terminal fragment; the 31 kDa fragment 
is a proteolytic product of the 37 kDa fragment. The 
fragments were separated by SDS--PAGE, electroeluted, 
and the denatured fragments were digested with lysyl 
endopeptidase (LysC), The resulting peptides were 

15 separated by re versed-phase HPLC and subjected to 
sequence analysis by Edman degradation (details con- 
tained in Goyette P et al*. Nature Genetics, 1994, 
2:195-200). 

20 Isolation and sequencing of cDNAs 

Two degenerate oligonucleotides were synthe- 
sized based on the sequence of a 30 amino acid porcine 
MTHFR peptide (first underlined peptide in Fig* 2)» 
These were used to generate a 90 bp PGR product, 

25 encoding the predicted peptide, from reverse tran- 
scription-PCR reactions of 500 ng pig liver polyA+ 
RNA. A pig-specific (non-degenerate, antisense) PGR 
primer was then synthesized from this short cDNA 
sequence. Using this primer and a primer for phage 

30 arms, a human liver IlgtlO cDNA library (Clontech) was 
screened by PGR; this technique involved the genera- 
tion of phage lysate stocks (50,000 pfu) which were 
boiled for 5 mins and then used directly in PGR reac- 
tions with these two primers. PGR fragments were 

35 then sequenced directly (Gycle Sequencing™ kit, 
GIBCO), and a positive clone was identified by com- 
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parison of the deduced amino acid sequence to the 
sequence of the pig peptide (allowing for inter-spe- 
cies variations ) • The positive stock was then 
replated at lower density and screened with the radio- 
5 labelled positive PCR product by plaque hybridization 
until a well-isolated plaque was identified • Phage 
DNA was purified and the insert was then subcloned 
into pBS+ (Bluescript) and sequenced on both strands 
(Cycle Sequencing™ kit, GIBCO and Sequenase™, Pharma- 

10 cia) . The deduced amino acid sequence of the human 
cDKA was aligned to the porcine peptide sequences, the 
metP genes from E.colx iecometf, accession number 
VO1S02) and S. typhxmurivm (stymetf, accession number 
X07689) and with a previously unidentified open read*- 

15 ing frame in Saccdxaromyces cerevxsiae that is divert- 
gently transcribed with respect to the excision repair 
gene, ysRADl (accession number KO2070>» The initial 
alignments were performed using BestFit™ in the GCG 
computer package, and these alignments were adjusted 

20 manually to maximize homologies. 

In summary, degenerate oligonucleotide primers 
were designed to amplify a sequence corresponding to a 
30-amino c.cid segment of a porcine peptide from the N- 
terminal region of the enzyme (first porcine peptide 

25 in Fig. 2). A 90-bp porcine cDNA fragment was 
obtained from reverse transcription/PCR of pig liver 
RNA* Sequencing of the PCR fragment confirmed its 
identity by comparison of the deduced amino acid 
sequence to the porcine peptide sequence- A nondegen- 

3a erate oligonucleotide primer, based on the internal 
sequence of the porcine cDNA, was used in conjunction 
with primers for the phage arms to screen a human 
liver XgtlO cDNA library by PCR. The insert of the 
positive clone was isolated and sequenced* The 
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sequence consisted of 1266 bp with one continuous open 
reading frame. 

Homology wltb MTHFR in other species 

5 The deduced amino acid sequence of the human 

cDNA was aligned with the metF genes from E.coli and 
5. typhimnrium^ as well as with a previously unidenti- 
fied ORF in Saccharomyces cerevisxae that is diver- 
gently transcribed with respect to the excision repair 

10 gene, ysRADl (Fig- 2). The sequences homologous to 5 
porcine peptides are underlined in Pig. 2. Three seg- 
ments (residues 61-94, 219-240, and 337-351) corre- 
spond to internal peptide sequence from the N-terminal 
40 kDa domain of the porcine liver enzyme. Residues 

15 374-393 correspond to the upstream portion of the LysC 
peptide from the C-terminal domain of the porcine 
liver enzyme that is labeled when the enzyme is irra- 
diated with UV light in the presence of ( Gu- 
rnet hyDAdoMet ? as predicted from the AdoMet labeling 

20 studies, this peptide lies at one end (N-terminal) of 
the 37 kDa domain. A fifth region of homology 
(residues 359-372) was also identified, but the local- 
ization of the porcine peptide within the native pro- 
tein had not been previously determined- 

25 Methylenetetrahydrofolate reductase (MTHPR) is 

an enzyme involved in amino acid metabolism, that is 
critical for maintaining an adequate methionine pool, 
as well as for ensuring that the homocysteine concen- 
tration does not reach toxic levels. The high degree 

30 of sequence conservation, from E»coli to Homo sapiens, 
attests to the significance of MTHFR in these species. 
The enzyme in E.coli (encoded by the metF locus) is a 
33 kDa peptide that binds reduced FAD and catalyzes 
the reduction of methylenetetrahydrofolate to 

35 methyltetrahydrof olate - The metF enzyme differs from 
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the mammalian enzyme in that it cannot be reduced by 
NADPH or NADH, and its activity is not allosterically 
regulated by S-*adenosylmethionine . The native porcine 
enzyme is susceptible to tryptic cleavage between the 
5 N- terminal 40 kDa domain and the C*terminal 37 kDa 
domain, and this cleavage results in the loss of 
allosteric regulation by adenosylmethionine, but does 
not result in loss of catalytic activity. Since the 
homology between the bacterial and mammalian enzymes 

10 is within the N-terminal domain, this region must con- 
tain the flavin binding site and residues necessary to 
bind the folate substrate and catalyze its reduction . 
The domain structure of the human enzyme has not been 
elucidated, although the human enzyme has been 

15 reported to have a molecular mass of 150 kDa and is 
likely to be a homodimer of 77 kDa* 

We predict that the point of cleavage between 
the two domains lies between residues 351 and 374 of 
the human sequence, based on the localization of pep-- 

20 tides obtained from the isolated domains of the por- 
cine enzyme. This region, containing the highly- 
charged sequence KRREED, is predicted to have the 
highest hydrophilicity and surface probability of any 
region in the deduced human sequence. 

25 The N-terminus of the porcine protein has been 

sequenced, and the region encoding this part of the 
protein is missing from the human cDNA. We estimate 
that this cDNA is missing only a few residues at the 
N-terminus, since the predicted molecular mass of the 

30 deduced sequence upstream of the putative cleavage 
site (KRREED) is 40 kDa, and the measured molecular 
mass of the porcine N-terminal domain is also 40 kDa. 
When the bacterial, yeast and human sequences are 
aligned, the deduced human sequence contains an N-ter- 

35 minal extension of 40 amino acids; we suspect that 
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this extension contains determinants for NADPH bind- 
ing . Many pyridine nucleotide-dependent oxidoreduc- 
tases contain such determinants at the N-terminus of 
the protein* 

5 The C-terminus of the human sequence contains a 

peptide that is labeled when the protein is irradiated 
with ultraviolet light in the presence of tritiated 
AdoMet- The cBNA sequence we report here contains 
only about 1 kDa of the predicted 37 kDa mass of this 

10 domain, indicating that this cDNA is truncated at the 
3 • terminus as well • A number of peptides from the C- 
teanainal porcine domain have also not been detected. 
As might be expected, given that the prokaryotic 
enzymes do not appear to be allosterically regulated 

15 by AdoMet, there are no significant hcmiologies between 
the C-terminal region in this cDHA and the prokairyotic 
metF sequences. The alignment shown in Fig. 2 shows 
that the homologous sequences terminate just prior to 
the putative cleavage site of the human enzyme. 

20 

Chromosomal assignment 

In situ hybridization to metaphase human chro- 
mosomes was used for localization of the human gene. 
The analysis of the distribution of 200 silver grains 
25 revealed a significant clustering of grains 40 grains, 
in the p36-3-36.2 region of chromosome 1 {p<0.0001), 
with the majority of grains, 25 grains, observed over 
lp36.3. 

The isolation of the human cDNA has allowed us 
30- to localize the gene to chromosome lp36-3- The obser- 
vation of one strong signal on that chromosome with 
little background is highly suggestive of a single 
locus with no pseudogenes* Southern blotting of human 
DNA revealed fragments of approximately 10 kb, pre- 
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dieting a gene of average size, since this cDNA 
encodes approximately half of the coding sequence • 

Additional cDNA sequences and constructs for expres- 
5 sion analysis 

A human colon carcinoma cDNA library (gift of 

Dr- Nicole Beauchemin, McGill University) was screened 

by plaque hybridization with the original 1.3 kb cDNA 

to obtain additional coding sequences. A cDNA of 2*2 

10 kb was isolated, which contained 1.3 kb of overlapping 
sequence to the original cDNA and 900 additional bp at 
the 3' end (Fig* 6). The amino acid sequence is iden- 
tical to that of the original cDNA for the overlapping 
segment (codons 1-415) except for codon 177 (ASP) 

15 which was a GLY codon in the original cDNA. Analysis 
of 50 control chromosomes revealed an ASP codon at 
this position. The cDNA has an open reading frame of 
1980 bp, 100 bp of 3* UTR and a poly A tail. 

Sequencing was perfojnaed on both strands for 

20 the entire cDNA. Additional 5' sequences (800 bp) 
were obtained from a human kidney cDNA library 
(Clontech) but these sequences did not contain addi- 
tional coding sequences and were therefore used for 
the PGR -based mutagenesis only (as described below) 

25 and not for the expression analysis. The two cDNAs 
(2.2 kb and 800 bp) were ligated using the ^coRI site 
at bp 199 euid inserted into the Bluescript™ vector 
(Stratagene) . The 2.2 kb cDNA was subcloned into the 
expression vector pTrc99A (Pharmacia) using the Ncol 

30 site at bp 11 and the Xbal site in the polylinker 
region of both the Bluescript"^ and the pTrc99A vec- 
tors. Sequencing was perfoanned across the cloning 
sites to verify the wild-type construct- 
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UTXIiITY OF INVENTION IN IDENTIFICATION OF MOTATIONS 

I. Identification of first two mutations in severe 
MTHFR deficiency 

Total RNA of skin fibroblasts from KTHFR-defi- 
5 cient patients was reverse-transcribed and amplified 
by PCR for analysis by the single strand conformation 
polymorphism (SSCP) method (Orita, et al.. Genom- 

ics, 1989, 5:8874-8879). Primers were designed to 
generate fragments of 250-300 bp and to cover the 

10 available cDl^A sequences with small regions of overlap 
for each fragment at both ends. The first mutation 
identified by SSCP was a C to T substitution at bp 559 
in patient 1554; this substitution converted an 
arginine codon to a termination codon (Pig. 3A) . 

15 Since the mutation abolished a Foici site, restriction 
digestion was used for conf ianaation of the change and 
for screening additional patients for this mutation; a 
second patient (1627) was identified in this manner 
(Pig. 3B). The SSCP pattern for patient 1554 and the 

20 restriction digestidn pattern for both patients was 
consistent with a homozygous mutant state or with a 
genetic compound consisting of the nonsense mutation 
with a second mutation that did not produce any 
detectable RNA (null allele). Studies in the parents 

25 are required for confirmation. 

The second substitution (Fig- 4A) was a G to A 
transition at bp 482 in patient 1834 that converted an 
arginine into a glutamine residue. The substitution 
created a PstI site which was used to verify the sub- 

30 stitution and to identify a second patient (1863) with 
this change (Fig. 4B). The SSCP analysis and the 
restriction digestion pattern were consistent with a 
heterozygous state for both patients. The arginine 
codon affected by this change is an evolutionarily- 

35 conserved residue, as shown in Fig. 2. This observa- 
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tion, in conjunction with the fact that the codon 
change is not conservative, makes a strong argument 
that the substitution is a pathologic change rather 
than a benign polymorphism • Furthermore^ 35 controls 
5 (of similar ethnic background to that of the probands) 
were tested for this substitution by Southern blotting 
of Pstl-digested DNA; all were negative. 

The family of patient 1834 was studied. The 
symptomatic brother and the mother of the proband were 
10 all shown to carry this substitution, whereas the 
father was negative for the change (Pig* 4B). In the 
family of 1863, the mother of the proband was shown to 
be a carrier, while the father and an unaffected 
brother were negative. 

15 

Cell lines 

Cell line 1554 is from a Hopi male who was 
admitted at age three months with homocystinuria, sei- 
zures, dehydration, corneal clouding, hypotonia and 

20 Candida sepsis. Folate distribution in cultured 
fibroblasts showed a Pediococcus 

cerivisiae/ Lactobacillus casei { PC/LC ) ratio of 0.52 
( Control 0.14). There was no measurable methy 1- 
enetetrahydrofolate reductase (MTHFB) activity 

25 (Control values = 9.7 and 15.1 nmoles/h/mg protein; 
residual activity after treatment of control extracts 
at SS^'C for 20 min. = 28% and 31%). 

Cell line 1627 is from a Choctaw male who pre- 
sented with poor feeding, apnea, failure to thrive, 

30 dehydration and homocystinuria at five weeks of age. 
He was subsequently found to have superior sagittal 
sinus thrombosis and hydrocephalus. The PC/LC ratio 
was 0.61 and the specific activity of MTHFR was 0.1 
nmoles/h/mg protein. There is consanguinity in that 
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the maternal and paternal grandmothers are thought to 
be "distantly related". 

Cell line 1779 is from a French Canadian male 
with homocystinuria who first had limb weakness, inco- 
5 ordination, paresthesiae, and memory lapses at age 15 
years, and was wheelchair-bound in his early twenties* 
His brother (cell line 1834) also has homocystinuria, 
but is 37 years old and asymptomatic. Specific activ- 
ity of MTHFR was 0.7 and 0.9 nmole/h/mg protein for 

10 1779 and 1834, respectively; the residual activity 
after heat treatment at 55 "^C was 0*9% and 0% for 1779 
and 1834, respectively - 

Cell line 1863 is from a white male who was 
diagnosed at age 21 years because of a progressive 

15 gait disturbance, spasticity, cerebral white matter 
degeneration, and hcHnocystinuria . He had a brother 
who died at age 21 years of neurodegenerative disease* 
Specific activity of MTHFR in fibroblast extracts was 
1.76 nmoles/h/mg protein and the residual enzyme ac- 

20 tivity after treatment at 55**C was 3.6%. 

nutation analysis 

Primers were designed from the cDNA seqiience to 
generate 250-300 bp fragments which overlapped 50- 

25 75 bp at each end. The primer pairs were used in 
reverse transcription-PCR of 5ng patient total fibro- 
blast RNA. The PGR products were analyzed by a non-- 
isotopic rapid SSCP protocol (PhastSystem^, Pharma- 
cia), which uses direct silver staining for detection 

30 of single strands. Any PGR products from patients 
showing a shift on SSCP gels were purified by NuSieve 
(FMC Bioproducts) and sequenced directly (Cycle 
Sequencing™ kit, GIBCO) to identify the change. If 
the change affected a restriction site, then a PGR 

35 product was digested with the appropriate restriction 
endonuclease and analyzed on polyacrylamide gels. To 
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screen for the Arg to Gin mutation in controls, 5 jxg 
of Pstl-digested DNA was run on 0-8% agarose gels and 
analyzed by Southern blotting using the radiolabelled 
cDNA by standeird techniques. 

5 

II . Seven additional mutations at t he methylene- 
tetrahydrofolate reductase (MTHFR) locus with 
genotype: phenotype correlations in severe MTHFR 
deficiency 

10 

It is reported hereinbelow the characterization 
of 7 additional mutations at this locus: 6 missense 
mutations and a 5 * splice site defect which activates 
a cryptic splice site in the coding sequence. We also 

15 present a preliminary analysis of the relationship 
between genotype and phenotype for all 9 mutations 
identified thus far at this locus. A nonsense muta- 
tion and 2 missense mutations (proline to leucine and 
threonine to methionine) in the homozygous state are 

20 associated with extremely low activity (0-3%) and 
onset of symptoms within the first yeeur. Other mis- 
sense mutations (arginine to cysteine and arginine to 
glutamine) are associated with higher enzyme activity 
and later onset of symptoms. 

25 7 additional mutations at the MTHFR locuj are 

described and the association between genotype, enzyme 
activity, and clinical phenotype in severe MTHFR defi- 
ciency is examined. 

30 Patient description 

The clinical and laboratory findings of the 
patients have been reported in the published litera- 
ture. Residual MTHFR activity was previously measured 
in cultured fibroblasts at confluence . 
35 Patient 354, an African -American girl, was 

diagnosed at age 13 years with mild mental retarda- 
tion. Her sister, patient 355 was diagnosed at age 15 
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years with anorexia, tremor, hallucinations and pro- 
gressive withdrawal • In patient 354, residual MTHFR 
activity was 19% and in her sister, 355, it was 14% of 
control values* The residual activity after heating 
5 had equivalent thermal stability to control enzyme. 

Patient 1807, a Japanese girl whose parents are 
first cousins, had delayed walking and speech until 
age 2 years, seizures at age 6 years and a gait dis- 
turbance with peripheral neuropathy at age 16 years. 

10 Residual activity of MTHFR was 3% and the enzyme was 
thermolabile . 

Patient 735, an African-Indian girl, was diag- 
nosed at age 7 months with microcephaly, progressive 
deterioration of mental development, apnea and coma* 

15 Residual activity of MTHFR was 2% of control levels. 
Thermal properties were not determined. 

Patient 1084, a Caucasian male, was diagnosed 
at age 3 months with an infantile fibrosarcoma. He 
was found to be hypotonic and became apneic. He died 

20 at the age of 4 months. Residual activity of MTHFR 
was not detectable. Thermal properties were not 
determined. 

Patient 356, the first patient reported with 
MTHFR deficiency, is an Italian-American male who pre- 
25 sented at age 16 years with muscle weakness, abnormal 
gait and flinging movements of the upper extremities. 
MTHFR residual activity was 20% of control values; 
activity was rapidly and exponentially inactivated at 
55''. 

30 Patient 458, a Caucasian male, was diagnosed at 

age 12 years with ataxia and marginal school perform- 
ance. Residual MTHFR activity was approximately 10%, 
and the activity was thermolabile. 

Patient 1396, a Caucasian female, was described 

35 as clumsy and as having a global learning disorder in 
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childhood. At age 14 years, she developed ataxia, 
foot drop, and inability to walk. She developed deep 
vein thrombosis and bilateral pulmonary emboli* Resid- 
ual activity of MTHFR was 14% and the enzyme was ther- 
5 molabile. 

Genomic stxnacture and dLntronic primers 

Kxon nomenclature is based on available cDNA 
sequence in Goyette et al- (Nature Genetics, 1994, 

10 2=^^5-200). Exon 1 has been arbitrarily designated as 
the region of cDNA from bp 1 to the first intron. 
Identification of introns was performed by amplifica- 
tion of genomic DNA using cDNA primer sequences* PGR 
products that were greater in size than expected cDNA 

15 sizes were sequenced directly, 

Matation detection 

Specific exons (see Table 1 for primer 
sequences) were amplified by PCR from genomic DNA and 

20 analyzed by the SSCP protocol. SSCP was performed 
with the Phastgel™ system (Pharmacia), a non-isotopic 
rapid SSCP protocol, as previously described (Goyette 
P et al.. Nature Genetics, 1994, 2:195-200), or with 
^%-labeled PCR products run on 6% acrylamide: 10% 

25 glycerol gels at room temperature (6 watts, over- 
night). In some cases, the use of restriction endonu- 
cleases, to cleave the PCR product before SSCP analy- 
sis, enhanced the detection of band shifts- PCR frag- 
ments with altered mobility were sequenced directly 

30 (GIBCO, Cycle Sequencing™ kit). If the sequence 
change affected a restriction endonuclease site, then 
the PCR product was digested with the appropriate 
enzyme and analyzed by PAGE. Otherwise, allele-spe- 
cific oligonucleotide (ASO) hybridization was per- 

35 formed on a dot blot of the PCR-amplif ied exon. 
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Table 1 

PCR PriJiiers for nHA amplification and mutation 
analysis of MTBFR 



Exon 


Primer TypB 


Primer Sequencft (6*-^*) 


tocatkm 




i 


Sense 
Antisense 


AGCCTCAACCCCTOCTTGGAGG 
TGACAGTrTGCTCCCCAGGCAC 


C 
1 


271 


4 


Senae 

Antisense 


TGAAGGAGAAGGTGTCTGCGGGA 
AGGACGGTGCGOreAGAGTGG 


C 
1 


196 


6 


Sense 
AfiHisense 


CACTGTTGGTTGGCATGGATGATG 
GGCTGCTCTTGGACCCTCCTC 


1 


392 


6 


Sense 

Anfaense 


TGCTTCXX5GCTCCCTCTAGCC 
CCTCCCGCTCCCAAGAACAAAG 


1 
I 


251 



Table 2 

Summary of genotypes, enayme «ctivi1^age at onset, 
^dbackground if patients witb MTHFR deficiency 



Pitient" 


BPChMiges^ 


Amino acid 


% 

ActfvUy 


AgeatOriMt 




1807 


C764nC764T 


Pro~>4jeu/PrD->tjBU 


3 


within Isk year 


Japanese 


735 


Ce92T/C6a2T 


TtK-^Wd/Thr-^Met 


2 


7 iiwnttis 


Africsn bncfian 


1084 


C69ZnCG92T 


Thr->Metmir">Met 


0 


3months 


Caucasian 


1554 


C559T/C559T 


Af9->Ter/Arg^Ter 


0 


1 month 


NaHve American 
(Hopi) 


16Z7 


C559TyC569T 


Arg-^Ter/Arg->Ter 


1 


1 month 


Nativtt American 
(ChodflM^ 


356 


C965T/C985T 


Af9->C/8/Af9->Cys 


20 


16yrs 


ttaian American 


458 


C1015T/G167A 


Ar8->Cya/Aig^G»n 


10 


llyrs 


Caucasian 


1396 


C10eiT/G167A 


Arg->Cy8/Ar^GHi 


14 


14yis 


Caucasian 


1779^ 


G482A^ 


Arg-^GW? 


6 


ISyis 


French Canacfian 


1834*^ 


G4a2A/? 


Arig-^lnf? 


7 


Asymptomatic 
at37yr$ 


French CanacSan 


1863 


G482Ar? 


AT9->^n/? 


14 


21yrs 


Caucasian 


354^ 


792 + 1G->A/? 


?spHce site/7 


19 


13yfs 


African Ameiican 


355d 


792+1G->A/? 


9 splice site/? 


14 


llvrs 


African American 



10 * Patients 1554,1627. 1779, 1834and1863werepres*)usl^ 
^ 7 =: unidentified mutation. 
^ Patienls 1779 and 1834 are stos. 
^ Patients 354 and 355 are 



kiyGoyetteetal. (1994). 
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(1) 5' splice site mutation 

Amplification of cDNA, bp 653-939, from 
reverse-transcribed total fibroblast RNA revealed 2 
5 hands in sisters 354 and 355: a smaller PGR fragment 
(230 bp) in addition to the normal 287 bp allele 
(Fig- 8A) . Fig- BA is the PAGE analysis of amplifica- 
tion products of cDNA bp 653-939, from reverse tran- 
scribed RNA* Controls have the expected 287 bp frag**- 

10 ment while patients 354 and 355 have an additional 230 
bp fragment- Sequencing of the smaller fragment iden- 
tified a 57 bp in-frame deletion which would remove 19 
amino acids (Fig. 8B). Fig. 8B is the direct sequenc- 
ing of the PCR products from patient 354- The 57 bp 

15 deletion spans bp 736-792 of the cDNA- An almost per- 
fect 5' splice site (boxed) is seen at the 5' deletion 
brefiikpoint. Analysis of the sequence at the 5' dele- 
tion breakpoint in the undeleted fragment revealed an 
almost perfect 5* splice site consensus sequence 

20 (AG/gcatgc). This obsearvation suggested the presence 
of a splicing mutation in the natural 5' splice site 
that might activate this cryptic site, to generate the 
deleted allele- The sequence following the deletion 
breakpoint, in the mutant allele, corresponded exactly 

25 to the sequence of the next exon. Amplification of 
genomic DNA, using the same amplification primers as 
those used for reverse- transcribed RNA, generated a 
1.2 kb PCR product indicating the presence of an 
intron. Direct sequencing of this PCR fragment in 

30 patient 354 identified a heterozygous G->A substitution 
in the conserved GT dinucleotide of the intron at the 
5' splice site (Fig. 8C) - Fig. 8C is the sequencing 
of the 5* splice site in control and patient 354- The 
patient carries a heterozygous G->A substitution in the 

35 5' splice site (boxed). Intronic sequences are in 
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lower case. This substitution abolished a HphI 
restriction endonuclease site which was used to con-- 
firm the mutation in the 2 sisters (Fig. 8D). Pig. 8D 
is the HphI restriction endonuclease analysis on PCR 
5 products of DNA for exon 4 of patients 354 and 355, 
and of 3 controls (C). The 198 bp PCR product has 2 
HphI sites. The products of digestion for the control 
allele are 151, 24 and 23 bp. The products of diges* 
tion for the mutant allele are 175 and 23 bp due to 
10 the loss of a Hphl site. The fragments of 24 and 23 bp 
have been run off the gel. 

(2) Patients with homozygous coding substitutions 

SSCP analysis of exon 4 for patient 1807 

15 revealed an abnormal ly-migrating fragment, which was 
directly sequenced to reveal a homozygous C->T substi- 
tution (bp 764) converting a proline to a leucine 
residue. This change creates a Mnll restriction endo-* 
nuclease site, which was used to confirm the homozy- 

20 gous state of the mutation (Pig. 9A). Fig. 9A is the 
Mill restriction analysis of exon 4 PCR products for 
patient 1807 and 3 controls (C). Expected fragments: 
control allele, 90, 46, 44, 18 bp; mutant allele, 73, 
46, 44, 18, 17 bp* An additional band at the bottom 

25 of the gel is the primer. Fifty independent control 
Caucasian chromosomes and 12 control Japanese chromo* 
somes were tested by restriction analysis; all were 
negative for this mutation. Homozygosity in this pa- 
tient is probably due to the consanguinity of the 

30 parents. 

Patients 735 and 1084 had the same mutation in 
exon 4, in a homozygous state: a C->T substitution (bp 
692) which converted an evolutionarily-conserved 
threonine residue to a methionine residue, and abol- 

35 ished a Wlalll restriction endonuclease site. Allele^ 
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specific oligonucleotide hybridization to amplified 
exon 4 (Figs. lOA and lOB) was used to confirm the 
mutation in these 2 patients and to screen 60 inde-- 
pendent chromosomes, all of which turned out to be 
5 negative- Fig. lOA is the hybridization of mutant 
oligonucleotide (692T) to exon 4 PCR products from 
patients 735, 1084 and 30 controls- Only DNA from 
patients 735 and 1084 hybridized to this probe. 
Fig- lOB is the hybridization of normal oligonucleo* 

10 tide (692C) to stripped dot blot from A. All control 
DNAs hybridized to this probe. 

Patient 356 showed a shift on SSCP analysis of 
exon 5- Direct sequencing revealed a homozygous C->T 
substitution (bp 985) which converted an evolutionar- 

15 ily-conserved arginine residue to cysteine; the sub- 
stitution abolished an Acil restriction endonuclease 
site. This was used to confirm the homozygous state 
of the mutation in patient 356 (Fig. 9B) and its pres- 
ence in the heterozygous state in both parents. Fifty 

20 independent control chromosomes, tested in the same 
manner, were negative for this mutation- Fig. 9B is 
the Acil restriction analysis of exon 5 PCR products 
for patient 356, his father (F), his mother (M), and 3 
controls (O- Expected fragments: control allele, 

25 129, 105, 90, 68 bp; mutant allele, 195, 129, 68 bp. 

(3) Patients who are genetic compounds 

Patient 458 is a compound heterozygote of a 
mutation in exon 5 and a mutation in exon 1. The exon 

30 5 substitution (C-^T at bp 1015) resulted in the sub- 
stitution of a cysteine residue for an arginine resi- 
due; this abolished a Hhal restriction endonuclease 
site, which was used to confirm the mutation in 
patient 458 (Fig. 9C) and to show that 50 control 

35 chromosomes were negative. Fig. 9C is the Hhal 
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restriction analysis of exon 5 PCR products for 
patient 458 and 4 controls (C). Expected fragments: 
control allele, 317 and 75 bp? mutant allele 392 bp. 
The 75 bp fragment is not shovm in Fig. 9C. The 
5 second mutation was a heterozygous G->A substitution 
(bp 167) converting an arginine to a glutamine resi- 
due- Allele-specific oligonucleotide hybridization to 
amplified exon 1 confirmed the heterozygous stats of 
this mutation in patient 458 and identified a second 

10 patient (1396) carrying this mutation also in the 
heterozygous state (Figs- IOC and lOD). Fig- IOC is 
the hybridization of mutant oligonucleotide (167A) to 
exon 1 PCR products from patients 458 , 1396 and 31 
controls • Fig* lOD is the hybridization of normal 

15 oligonucleotide {167G) to stripped dot blot from C. 
None of the 62 control chromosomes carried this muta- 
tion . 

The second mutation in patient 1396 was identi- 
fied in exon 6: a heterozygous C->T substitution (bp 

20 1081) that converted an arginine residue to a cysteine 
residue, and abolished a Hhal restriction endonuclease 
site. Restriction analysis confirmed the heterozygous 
substitution in 1396 (Fig* 9D) and showed that 50 con- 
trol chromosomes were negative. Fig. 9D is the Hbal 

25 restriction analysis of exon 6 PCR products for 
patient 1396 and 2 controls (C). Expected fragments; 
control allele, 152, 86, 13 bp; mutant allele 165, 86 
bp- The 13 bp fragment has been run off the gel. 

30 (4) Additional sequence changes 

Hhal analysis of exon 6, mentioned above, 
revealed a DNA polymorphism. This change is a T->C 
substitution at bp 1068 which does not alter the amino 
acid (serine), but creates a flhal recognition site. T 

35 at bp 1068 was found in 9% of tested chromosomes. 
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Sequence analysis identified 2 discrepancies with the 
published cDNA sequence: a G->A substitution at bp 542 
which converts the glycine to an aspartate codon, and 
a C-^T change at bp 1032 which does not alter the amino 
5 acid (threonine)* Since all DNAs tested (>50 chromo- 
somes) carried the A at bp 542 and the T at bp 1032, 
it is likely that the sequence of the original cDNA 
contained some cloning artifacts. 

10 Genotype :pheno1:ype correlation 

Table 2 summarizes the current status of muta- 
tions in severe MTHFR deficiency. In 8 patients, both 
mutations have been identified; in 5 patients (3 fami- 
lies), only 1 mutation has been identified. Overall 

15 the correlation between the genotype, enzyme activity, 
and phenotype is quite consistent. Five patients, 
with onset of symptoms within the first year of life, 
had <. 3% of control activity. Three of these patients 
had missense mutations in the homozygous state: two 

20 patients with the threonine to methionine substitution 
(C692T) and one patient with the proline to leucine 
substitution (C764T). The nonsense mutation (C559T) 
in the homozygous state in patients 1554 and 1627 
(previously reported in Goyette P et al.. Nature 

25 Genetics, 1994, 7:195-200) is also associated with a 
neonatal severe form, as expected. 

The other patients in Table 2 had > 6% of con- 
trol activity and onset of symptoms within or after 
the 2nd decade of life; the only exception is patient 

30 1834, as previously reported (Goyette P et al.. Nature 
Genetics, 1994, 7:195-200). The three patients (356, 
458 and 1396) with missense mutations (G167A, C985T, 
C1015T and C1081T) are similar to those previously 
reported (patients 1779, 1834 and 1863) who had an 

35 arginine to glutamine substitution and a second 
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unidentified mutation (Goyette P et al.. Nature Genet- 
ics, 1994, 25195-200). The sisters with the 5' splice 
mutation and an unidentified second mutation also had 
levels of activity in the same range and onset of 
5 symptoms in the second decade, but the activity is 
likely due to the second unidentified allele. 

Discussion 

The patients come frcMn diverse ethnic back- 

10 grounds. Although patients 1554 and 1627 are both 
Native Americans, the mutations occur on different 
haplotypes, suggesting recurrent mutation rather than 
identity by descent. Since the substitution occurs in 
a CpG dinucleotide, a "hot spot" for mutation, recur- 

15 rent mutation is a reasonable hypothesis. It is dif- 
ficult to assess whether some mutations are popula- 
tion-specific since the numbers are too small at the 
present time- 

MTHFR deficiency is the most common inborn 

20 error of folate metabolism, and a major cause of 
hereditary homocysteinemia. The recent isolation of a 
cDNA for MTHPR has permitted mutational analysis at 
this locus, with the aims of defining important 
domains for the enzyme and of correlating genotype 

25 with phenotype in MTHFR-def icient patients. 

Our definition of a disease-causing substitu- 
tion, as distinct from a benign polymorphism, is based 
on 3 factors: (1) absence of the change in at least 50 
independent control chromosomes; (2) presence of the 

30 amino acid in the bacterial enzyme, attesting to its 
evolutionary significance and (3) whether the change 
in amino acid is conservative. Although expression of 
the substitutions is required to formally prove that 
they are not benign, the criteria above allow us to 
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postulate that the changes described in this report 
are likely to affect activity • 

The 7 mutations described here {6 single amino 
acid substitutions and a 5' splice site mutation) 
5 bring the total to 9 mutations identified thus far in 
severe MTHFR deficiency and complete the mutation 
analysis for 8 patients. The identification of each 
mutation in only one or two families points to the 
striking degree of genetic heterogeneity at this 
10 locus. Seven of the 9 mutations are located in CpG 
dinucleotides, which are prone to mutational events. 

5* splice site mutation 

The G->A substitution at the GT dinucleotide of 

15 the 5' splice site in patients 354 and 355 results in 
a 57bp in -frame deletion of the coding sequence, which 
should delete 19 amino acids of the protein. This 
deletion occurs as a result of the activation of a 
cryptic 5* splice site (AG/gc) even though this cryp* 

20 tic site does not have a perfect 5* splice site con- 
sensus sequence (AG/gt). However, GC (instead of GT) 
as the first 2 nucleotides of an intron have been 
reported in several naturally-occurring splice sites, 
such as in the genes for human prothrombin and htiman 

25 adenine phosphor ibosyltransf erase and twice within the 
gene for the largest subunit of mouse RNA polymerase 
II- The remaining nucleotides of the cryptic site 
conform to a normal splice site consensus sequence 
with its expected variations (A50 

30 G79/giootiooa59^7l982t50>- unlikely that the 

deleted enzyme resulting from this aberrantly-spliced 
mRNA would have any activity; 8 of the 19 deleted 
amino acids are conserved in the E. coli enzyme. 
Although the 2 patients show residual enzyme activity 

35 in the range of 20% of controls, the activity is prob- 
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ably due to the unidentified second allele in these 
patients . 

6 missense mutations 

5 The hrg~>Cys substitution (C1081T) in patient 

1396 is within a hydrophilic sequence previously pos- 
tulated to be the linker region between the catalytic 
and regulatory domains of MTHFR (Goyette P et al.. 
Nature Genetics, 1994, 2:195-200). These 2 domains 

10 are readily separable by mild trypsinization of the 
porcine enzyme. The linker domain, a highly-charged 
region, is likely to be located on the outside surface 
of the protein and therefore more accessible to prote- 
olysis. Because the Arg-->Cys substitution converts a 

15 charged hydrophilic residue to an uncharged polar 
residue, it cannot be considered a conservative 
change, and could affect the stability of the enzyme. 

The 2 Arg->Cys substitutions identified in 
patients 356 and 458 {C985T and C1015T, respectively) 

20 may be involved in binding the FAD cof actor. Previ- 
ous work in the literature showed that heating fibro- 
blast extracts at 55**, in the absence of the FAD 
cof actor, inactivated MTHFR completely. The addition 
of FAD to the reaction mixture before heat inactiva- 

25 tion restored some enzyme activity to control extracts 
and to extracts from some patients, while the extracts 
of patients 356 and 458 were unaffected. Based on 
these observations, it was suggested that these 2 
patients had mutations affecting a region of the pro- 

30 tein involved in binding FAD. The 2 mutations are 
found in close proximity to each other, within 11 
amino acids. In patient 356, the Arg residue is evo- 
lutionarily^conserved in coli and is found in a 

stretch of 9 conserved amino acids, suggesting a 

35 critical role for this residue; the altered Arg resi- 
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due in patient 458 is not evolutionarily-conserved- 
Crystal structure analysis of medium chain acyl-CoA 
dehydrogenase (MCAD), a f lavoprotein, has defined 
critical residues involved in the binding of FAD, Two 
5 consecutive residues of the MCAD protein, Metl65 and 
Trpl66, involved in interactions with FAD, can also be 
identified in MTHFR, 3 and 4 amino acids downstreeua, 
respectively, from the Arg residue altered in patient 
458. 

10 The Thr-met substitution (C692T), is found in 

a region of high conservation with the E. colx enzyme 
and in a region of good homology with human dihydro- 
folate reductase (DHFR) (Fig. 11). In Fig* 11, = is 
identity; • is homology; and O is identity to bovine 

15 DHFR enzyme- An asterisk (*) indicates location of 
Thr->Met substitution. Considering the early-onset 
phenotype of the patients, one can assume that the 
threonine residue is critical for activity or that it 
contributes to an important domain of the protein. 

20 This region of homology in DHFR contains a residue, 
Thrl36, which has been reported to be involved in 
folate binding. This Thr residue in DHFR aligns with 
a Ser residue in MTHFR, an amino acid with similar 
biochemical properties. The Thr -> Met substitution is 

25 located 8 amino acids downstream from this Ser codon, 
in the center of the region of homology between the 2 
enzymes. We therefore hypothesize that the Thr -> Met 
substitution may alter the binding of the folate sub- 
strate . 

30 The G167A (Arg Gin) and C764T (Pro Leu) 

substitutions both affect non-conserved amino acids. 
Their importance in the development of MTHFR defi- 
ciency cannot be determined at the present time. All 
the mutations identified thus far cire located in the 

35 5' end of the coding sequence, the region thought to 
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encode the catalytic domain of MTHFR* Mutation analy- 
sis has been useful in beginning to address the struc- 
ture: function properties of the enzyme as well as to 
understand the diverse phenotypes in severe MTHPR 
5 deficiency. 

III. Identification of A~>V mutation 

SSCP analysis and direct sequencing of PGR 
fragments were used to identify a C to T substitution 

10 at bp 677, which converts an alanine residue to a 
valine residue (Fig. 5A)- The primers for analysis of 
the A-^V change are: 5 • ^TGAAGGAGAA GGTGTCTGCG GGA-3 • 
{ exonic ) and 5 • -AGGACGGTGC GGTGAGAGTG-3 ' ( intronic ) ; 
these primers generate a fragment of 198 bp. Fig* 5A 

15 depicts the sequence of two individuals, a homozygote 
for the alanine residue and a homozygote for the 
valine residue- The antisense strands are depicted. 
This alteration creates a Hinfl site (Fig, 5B), which 
was used to screen 114 unselected French Canadian 

20 chromosomes; the allele frequency of the substitution 
was ,38, The substitution creates a Hixifl recognition 
sequence which digests the 198 bp fragment into a 175 
bp and a 23 bp fragment? the latter fragment has been 
run off the gel. Fig. 5B depicts the three possible 

25 genotypes* The frequency of the 3 genotypes were as 
follows: 37%; 51%? and +/+, 12% (the ( + ) 

indicates the presence of the Hinfl restriction site 
and a valine residue). 

The alanine residue is conserved in porcine 

30 MTHPR, as well as in the corresponding metF and 
stymetF genes of coll and 5. typhimurium, respec- 
tively. The strong degree of conservation of this 
residue, and its location in a region of high homology 
with the bacterial enzymes, alluded to its importance 

35 in enzyme structure or function* Furthermore, the 
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frequency of the (-^/+) genotype was consistent with 
the frequency of the thermolcitoile MTHFR variant ia^li- 
cated in vascular disease. 

5 Cli4iical material 

To determine the frequency of the A->V muta- 
tion, DNA from 57 individuals from Quebec was analyzed 
by PCR and restriction digestion. The individuals, 
who were all French Canadian, were not examined clini- 

10 cally or biochemically - 

The 40 individuals analyzed in Table 3 had been 
previously described in Engbersen et al- (Am. J. Hum. 
Genet*, 1995, M:i42-150). Of the 13 cardiovascular 
patients, 8 had cerebrovascular arteriosclerosis and 5 

15 had peripheral arteriosclerosis- Five had thermo- 
labile MTHFR while 8 had thermostable MTHFR (greater 
than 33% residual activity after heating). Controls 
and patients were all Dutch-Caucasian, between 20-60 
years of age. None of these individuals used vitamins 

20 which could alter homocysteine levels. Enzyme assays 
and homocysteine determinations were also reported by 
Engbersen et al. (Am, J. Hum. Genet., 1995, 56:142- 
150). 
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Tai^ig 3 

Correlation between NTHFR genotype and enzyme activ- 
ity, thermolability and plasma lionK>cysteine level 

5 













n»19 




n»12 


specffic activtty'*'' 


22.9±1.7 


15.0 ±0.8 


6.9 ±0.6 


(nmol CH2Q/mg.prolein/hi1 


(11.8-33.8) 


(10.2-18.8) 


(2.6-10.2) 


residual actMty after 


66.8 ±1.5 


56.2 ±2.8 


21.8 ±2.8 


heating^'b (%) 


(55-76) 


(41-67) 


(10-36) 


piasma homocysteine 


12.6 ±1.1 


13.8 ± 1.0 


22.4 ±2.9 


(|jMXafterfi»ting) 


(7-21) 


(9.6-20) 


(9.6-42) 


plasma homocysteine 


41.3±5.0<' 


41 ±2.8 


72.6 ±11.7® 


(MMXpost-methionine load) 


(20.9-110) 


(29.1-54) 


(24.4-159) 



^ one-way anova |X.01 

^ paired t test for all combinations p<.01 

^ paired t test p<,05 for +/+ group veisus +/- group or -A group: p>-05 for +/* versus 
group. 

10 n==1 8 for this parameter 

® n==1 1 for this poranrieter 

Enzyme activity and plasma homocysteine were 
determined as previously reported. Each value repre- 
sents mean ± standard error. The range is given in 
15 parentheses below the mean. 

Correlation of A-^V muiiation with altered MXHFR func- 
tion 

A genotypic analysis was performed and enssyme 
20 activity and theormolability were measured in a total 
of 40 lymphocyte pellets from patients with premature 
vascular disease and controls. 13 vascular patients 
were selected from a previous study (Engbersen et al.. 
Am. J. Hum. Genet., 1995, M- 142-150), among which 5 
25 were considered to have thermolabile MTHFR. From a 
large reference group of 89 controls, all 7 
individuals who had thermolabile MTHFR were studied, 
and an additional 20 controls with normal MTHFR were 
selected from the same reference group. Table 3 
30 documents the relationship between genotypes and spe- 



wo 9503054 



PCT/CA95m0314 



cific enzyme activity, thermolability and plasma homo- 
cysteine level. The mean MTHFR activity for individu- 
als homozygous for the substitution (+/+) was approxi- 
mately 30% of the mean activity for (-/-) individuals, 
5 homozygous for the alanine residue. Heterozygotes had 
a mean MTHFR activity that was 65% of the activity of 
(-/-) individuals; this value is intermediate between 
the values for (-/-) and {+/+) individuals- The 
ranges of activities showed some overlap for the 

10 heterozygous and (-/-) genotypes, but homozygous (+/+) 
individuals showed virtually no overlap with the for- 
mer groups* A one-way analysis of variance yielded a 
p value <-0001? a paiarwise Bonferroni t test showed 
that all three genotypes were significantly different 

15 with p<0-01 for the three possible combinations. 

The three genotypes were all significantly dif- 
ferent (p<,01) with respect to enzyme thermolability- 
The mean residual activity after heat inactivation for 
5 minutes at 46* was 67%, 56% and 22% for the (-/-), 

20 {+/-) and (+/+) genotypes, respectively. While the 
degree of theraolability overlaps somewhat for {-/-) 
individuals and heterozygotes, individuals with two 
mutant alleles had a distinctly lower range. Every 
individual with the (+/+) genotype had residual acLiv- 

25 ity <35% after heating, and specific activity <50% of 
that of the (-/-) genotype. 

Total homocysteine concentrations, after fast- 
ing and 6 hours after methionine loading, were meas- 
ured in plasma by high performance liquid chromatogra- 

30 phy using fluorescence detection. Fasting homocysteine 
levels in (+/+) individuals were almost twice the 
value for (+/-) and (-/-) individuals. The differ- 
ences among genotypes for plasma homocysteine were 
maintained when homocysteine was measured following 6 

35 hours of methionine loading. A one-way anova yielded 
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a p < -01 for the fasting and post-raethionine homocys- 
teine levels. A pairwise Bonferroni t test showed 
that only homozygous mutant individuals had signifi- 
cantly elevated homocysteine levels (p<*05). 

5 

PCR'-based mutagenesis for expression of A->V mutation 
In vitro 

PCR-based mutagenesis, using the cDNA-contain- 
ing Bluescript™ vector as template, was used to create 

10 the A to V mutation. Vent™ polymerase (NEB) was used 
to reduce PCR errors. The following primers were 
used: primer 1, bp -200 to -178, sense; primer 2, bp 
667 to 687, antisense, containing a mismatch. A, at bp 
677; primer 3, 667 to 687, sense, containing a mis- 

15 match, T, at bp 677; primer 4, bp 1092 to 1114, 
antisense. PCR was perfomed using primers 1 and 2 to 
generate a product of 887 bp, and using primers 3 and 
4 to generate a product of 447 bp. The two PCR frag- 
ments were isolated from a 1.2% agarose gel by Gene- 

20 clean™ (BIO 101). A final PCR reaction, using primers 
1 and 4 and the first 2 PCR fragments as template, was 
performed to generate a 1.3 kb band containing the 
mutation. The 1.3 kb fragment was digested with Ncol 
and Msclf and inserted into the wild-type cDNA- con- 

25 taining expression vector by replacing the sequences 
between the l^col site at bp 11 and the JMscI site at bp 
943. The entire replacement fragment and the cloning 
sites were sequenced to verify that no additional 
changes were introduced by PCR. 

30 

Expression analysis of wild-type and mutagenized cDNA 

Overnight cultures of JM105™ containing vector 
alone, vector + wild-type MTHFR cDNA, or vector + 
mutagenized cDNA were grown at 37** C. in 2 x YT media 
35 with -05 mg/ml ampicillin. Fresh 10 ml. cultures of 
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each were inoculated with approximately 50 of over- 
night cultures for a starting 0*D. of 0.05, and were 
grown at 37** C to an 0*D- of 1 at 420 nM. Cultures 
were then induced for 2 hrs* with 1 mM IPTG and pel- 
5 letted. The cells were resuspended in TE buffer with 
2 jig/ml aprotinin and 2ng/ml leupeptin (3-5 x wet 
weight of cells). Cell suspensions were sonicated on 
ice for 3 x 15 sec. to br^ak open cell membranes and 
then centrifuged for 30 mins. at 4**C- to pellet cell 

10 debris and unlysed cells- The supernatant was removed 
and assayed for protein concentration with the Bio-Rad 
protein assay. Western analysis was performed using 
the Amersham ECL*^ kit according to the instructions 
of the supplier, using antiserum generated against 

15 purified porcine liver MTHFR. Enzymatic assays were 
performed by established procedures; thermolability 
was assessed by pre-treating the extracts at 46 °C- for 
5 mins- before determining activity. Specific activi- 
ties (nmol formaldehyde/hr./mg. protein) were calcu- 

20 lated for the 2 cDNA-containing constructs after sub- 
traction of the values obtained with vector alone (to 
subtract background E, coll MTHFR activity). 

The MTHFR cDNA (2.2 kb) (Fig. 6) has an open 
reading frame of 1980 bp, predicting a protein of 74.6 

25 kDa. The purified porcine liver enzyme has been shown 
to have subunits of 77 kDa* Western analysis 
(Fig. 7A) of several human tissues and of porcine 
liver has revealed a polypeptide of 77 kDa in all the 
studied tissues, as well as an additional polypeptide 

30 of approximately 70 kDa in human fetal liver and in 
porcine liver, suggesting the presence of isozymes - 
Two |ig of bacterial extract protein was used for lanes 
1-3. The tissues (lanes 4-8) were prepared by 
homogenization in .25M sucrose with protease 

35 inhibitors (2 |ig/ml each of aprotinin and leupeptin). 
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followed by sonication (3 x 15 sec.) on ice* The 
extracts were spun for 15 min. in a microcentrifuge at 
14,000 g and 100 |ig of supernatant protein was used 
for Western analysis, b^human; p=porcine. 
5 The wild-type cDNA and a mutagenized cDNA, con- 

taining the A->V substitution, were expressed in E. 
coli to yield a protein of approximately 70 kDa 
(Fig. 7A), which co-migrates with the smaller polypep- 
tide mentioned above. Treatment of extracts at 46'*C 

10 for 5 minutes revealed that the enzyme containing the 
substitution was significantly more thermolabile than 
the wild-type enzyme (p<.001? Fig- Two separate 

experiments (with 3-4 replicates for each construct 
for each experiment) were performed to measure thermo- 

15 stable activity of the wild-type MTHPR and mutagenized 
MTHFR A->V cDNAs. The values shown represent mean ± 
standard error for each experiment, as % of residual 
activity after heating. The means of the specific 
activities before heating (expressed as nmol formalde- 

20 hyde/hr./mg. protein) were as follows: Exp. 1, 3.8 and 
5-3 for MTHFR and MTHFR A->V, respectively; Exp. 2, 6.2 
and 7.5 for MTHFR and MTHFR A-^V, respectively. The 
expression experiments were not designed to measure 
differences in specific activity before heating, since 

25 variation in efficiencies of expression could contrib- 
ute to difficulties in inteirpretation. Curiously 
though, the specific activity for the mutant construct 
was higher in both experiments. It is possible that 
the mutant protein has increased stability in E. coli, 

30 or that inclusion bodies in our extracts contributed 
to differences in recovery of properly-assembled 
enzyme. 

These studies have identified a common substi- 
tution in the MTHFR gene which results in theirmolabil- 
35 ity in vitro and in vivo. The mutation, in the 
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heterozygous or homozygous state, correlates with 
reduced enzyme activity and increased thermolability 
of MTHFR in lynqphocyte extracts. A significant eleva- 
tion in plasma homocysteine was observed in individu- 
5 als who were homozygous for the mutation. Statisti- 
cally-significant differences for hcanocysteine levels 
were not observed between heterozygotes and {-/-) 
individuals; this observation is not surprising, since 
plasma homocysteine can be influenced by several envi- 

10 ronmental factors, including intake of folate, vitamin 
®12' vitamin B5, and methionine, as well as by genetic 
variation at other loci, such as the cystathionine-6- 
synthase gene. 

The alanine to valine substitution conserves 

15 the hydrophobicity of the residue and is associated 
with small changes in activity, in contrast to non- 
conservative changes, such as the previously-reported 
arginine to gluteuaine change in MTHFR, which is asso- 
ciated with a greater decrease in enzyme activity and 

20 severe hyperhomocysteinemia. The alanine residue is 
situated in a region of homology with the bacterial 
metF genes. We have also observed the S£une region of 
homology in the human dihydrofolate reductase (DHPR) 
gene (Fig. 11), although the alanine residue itself is 

25 not conserved; this region of amino acids 130-149 of 
DHFR contains T136 which has been implicated in folate 
binding in an analysis of the crystal structure of 
recombinant human DHFR. It is tempting to speculate 
that this region in MTHFR is also involved in folate 

30 binding and that the enzyme may be stabilized in the 
presence of folate. This hypothesis is compatible 
with the well-documented influence of folate on homo- 
cysteine levels and with the reported correction of 
mild hyperhomocysteinemia by folic acid in individuals 
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with premature vascular disease, and in individuals 
with thermolabile MTHFK. 

Although our cDNA is not long enough to encode 
the larger BlTHFR pol]^eptide, it is capable of direct- 
5 ing synthesis of the smaller isozyme. The ATG start 
codon for this polypeptide is within a good consensus 
sequence for translation initiation* Whether the 
isozyme is restricted to liver and what its role is in 
this tissue remain to be determined- 

10 These data have identified a common genetic 

change in MTHPR which results in thermolability; our 
experiments do not directly address the relationship 
between this change and vascular disease. Nonethe- 
less, this polymoirphism represents a diagnostic test 

15 for evaluation of MTHFR thermolability in hyperhomo- 
cysteinemia- Large case-control studies are required 
to evaluate the frequency of this genetic change in 
various forms of occlusive arterial disease and to 
examine the interaction between this genetic marker 

20 and dietary factors. Well-defined populations need to 
be examined, since the limited data set thus far sug- 
gests that population-specific allele frequencies may 
exist. More importantly, however, the identification 
of a candidate genetic risk factor for vascular dis- 

25 ease, which may be influenced by nutrient intake, rep- 
resents a critical step in the design of appropriate 
therapies for the homocysteinemic form of arterioscle- 
rosis • 

30 CDNA FOR M!rHFR AND ITS POTENTIAL UTILITY 

A human cDNA for MTHFR (2-2 kb) has been iso- 
lated, as reported by us in Goyette et al. {Nature 
GBnetica, 1994, 2=195-200) and Frosst et al. {Nature 
Genetics/ 1995, 10:111-113). The cDNA has been 
35 expressed in vitro to yield a MTHFR protein of 
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approximately 70 kDa (Frosst P et al.. Nature Genet- 
ics, 1995, 10:111--113). 

Using the cDNA sequence, mutations in patients 
with severe and mild MTHFR deficiency (Goyette P et 
5 al^. Nature Genetics, 1994, 7:195-200; Goyette P et 
al,. Am. J. Hum. Genet,, 1995, 56:1052-1059; Frosst P 
et al.. Nature Genetics, 1995, 10:111-113) were iden- 
tified. 

The cDNA sequence is a necessary starting point 
10 for the detection of MTHFR sequence changes that would 
identify individuals at risk for cardiovascular and 
neurological diseases, as well as other disorders 
affected by folic acid metabolism- Diagnostic tests by 
DNA analysis are more efficient and accurate than 
15 testing by enzymatic/biochemical assays. Less blood is 
required and results are available in a shorter period 
of time. The tests could be performed as a routine 
operation in any leiboratory that performs molecular 
genetic diagnosis, without the specialized 
20 reagents /expertise that is required for an enzyme- 
based test. 

The second major utility of the cDNA would be 
in the design of therapeutic protocols, for correction 
of MTHFR deficiency. These protocols could directly 

25 involve the gene, as in gene therapy trials or in the 
use of reagents that could modify gene expression. 
Alternatively, the therapy might require knowledge of 
the amino acid sequence (derived from the cDNA 
sequence), as in the use of reagents that would modify 

30 enzyme activity* The identification of sequences 
and/or sequence changes in specific regions of the 
cDNA or protein, such as FAD binding sites or folate- 
binding sites, are useful in designing therapeutic 
protocols involving the above nutrients. 



35 
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UTILITY OF INVEKfTION IN C3:.IKICAL AND DIAQK>8TIC 
STODISS 

Coronary artery disease patients in Montreal 
(n=153) were studied to examine the frequency of the 
5 alanine to valine substitution* Fourteen percent of 
the patients were homozygous for this mutation. An 
analysis of 70 control individuals (free of cardiovas- 
cular disease) demonstrated that only seven % of these 
individuals were homozygous for the alanine to valine 
10 mutation. 

Analysis of homocysteine levels in 123 men of 
the above patient group indicated that the mutant 
allele significantly raised homocysteine levels from 
10*2 micromoles/L in homozygous normal men to 11.5 and 
15 12.7 in heterozygotes and homozygous mutants, respec-- 
tively. 

Families with a child with spina bifida, a 
neural tube defect, have been examined for the 
presence of the alanine to val-ine mutation. 

20 Approximately 16% of mothers who had a child with 
spina bifida were homozygous for this mutation, while 
only 5% of control individuals were homozygous. 
Fathers of children with spina bifida also had an 
increased prevalence of the homozygous mutant genotype 

25 (10%) as did the affected children themselves (13%). 

TaU^le 4 indicates the interactive effect of 
folic acid with the homozygous mutant alanine to 
valine change. In a study of families from Framingham, 
Massachusetts and Utah, individuals who were 

30 homozygous mutant but had folate levels above 5 ng/ml 
did not have increased homocysteine levels compared to 
individuals with the normal or heterozygous genotype. 
However, individuals who were homozygous mutant but 
had folate levels below 5 ng/ml had homocysteine 

35 levels that were significantly higher than the other 
genotypes . 
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Table 4 

Mean fasting and PML homocysteine levels for different 

MTHFR genotypes 



PluniH HomocysteiM 










N 


58 


61 


30 








92 


1Z1 


0.Q2 


Folate <5n9^L 


10^ 


ia4 


15^2 


aoo2 


Folate^SngAnL 


8J2 


7.5 


7.5 


0.S2 


PosJ-Mettiiorrino load 


30,0 


30.9 


31.3 


0.G2 



5 *S|gnffk:antintmctk^ 

Example III provides preliminary data for 
therapeutic intervention by folic acid supplementation 
to individuals who are homozygous for the alanine to 

10 valine change* The data suggest that higher levels of 
plasma folate would lead to normalization of homocys- 
teine levels in mutant individuals and might prevent 
the occurrence of disorders associated with high homo- 
cysteine levels r such as cardiovascular disease^ neu- 

15 ral tube defects, and possibly other disorders- Folic 
acid suppler^sntation for mutant individuals might also 
restore methionine and S^adenosylmethionine levels to 
normal. This would be relevant for disorders that are 
influenced by methylation, such as neoplasias, devel- 

20 opmental anomalies, neurologic disease, etc* 

While the invention has been described in con- 
nection with specific embodiments thereof, it will be 
understood that it is capable of further modifications 
and this application is intended to cover any varia- 

25 tions, uses, or adaptations of the invention follow- 
ing, in general, the principles of the invention and 
including such departures from the present disclosure 
as come within known or customary practice within the 
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art to which the invention pertains and as may be 
applied to the essential features hereinbefore set 
forth, and as follows in the scope of the appended 
claims - 
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WE CIAIM ; 

1, A cDNA probe for human methylenetetrahydro- 
folate reductase (MTHFR). 

2. Use of the probe of claim 1 for the iden- 
tification of sequence abnormalities in patients with 
severe or mild MTHFR deficiency. 

3- The use of claim 2, wherein said deficiency is 

selected from the group consisting of cardiovascular 
and neurological disorders and disorders influenced by 
folic acid metabolism. 

4. Use of the probe of claim 1 for gene therapy to 
produce the MTHFR protein. 

5. A human MTHFR protein which is encoded by the 
probe of claim 1. 

6., Use of the probe of claim 1 and the protein of 
claim 4 for therapy of MTHFR-def iciency patients by 
dietary, biochemical or pharmacological approaches. 
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10 70 30 U SO 

«»XXIl»X«»» 

AA1 T[[ 60A 6CC A16 616 AAC 6AA &C[ A6A 66A AA( A6( AG[ (T[ AAC (([ 
TTA A66 C(T CG6 lAC (AC 116 [IT [66 T[T [[I 116 T[G T[6 6A6 116 666 
AsR Ser 6ly All Net Val Aso 61a Ala Arf GIf Asn Ser Ser leu AsR Pr»> 



60 71 

It ' I X X X 

T6C TIG GAG 6G[ A6T &(( AG[ AGT 6G[ 

A(G AA[ Ut [[6 r[A (66 TCG Uk [C6 

[|fs Lttt Gla 6l]r Ser Al< Ser Ser GIf 



10 9t IK 

X X M X X 

AGT GAG AG[ T[[ AAA GAT AGT TCG 
T(A [T[ T[G AGG TTT (TA ICA AG[ 
Ser Glu Ser Ser Lys Asp Ser Ser> 



IK 120 

X X X X X 

AGA T6T 1[( A([ (CG 66C [T6 6A[ 

TCT A(A A6G TGG 6G[ ((6 6A[ CT6 

Ar; (;s Ser Tbr Prv Gly lev Asp 



160 170 

X X X X X 

AAG ATG A66 CGG (GA TTG 6AA T(T 
TT[ TAC T(( G[[ 6tT AAC (TI AGA 
lys Wt\ Arg Ar; Arj leu &ltt Ser 



no HO ISO 

XX X XX 

[[1 GAG CGG [AT GAG AGA [T[ [GG GAG 
GGA [T( G([ GIA [T( TCT GAG GCC (TC 
Pro Gl» Arg His Glu Arg leu Arg 6lo> 



I8( 190 200 

XX XXX 

GGT GAC AA6 TG6 TTC ICC CT6 &AA TTC 
CCA CTG TTC ACC AAG AGG GAC (TT AAG 
Gly Asp lys Trp Phe Ser lev Glu Plie> 



210 220 230 240 2S0 

XX X XX XXX XXX 

TTC CCT CCT [6A ACT 6CT 6A6 6GA GCT GTC AAT CTC ATC TCA AGG TTT 6AC 

AAG GGA GGA GCT TGA C6A CTC CCT CGA CAG TTA GAG TAG AGT TCC AAA CTG 

Phe Pro Pro Arg Thr Ala Glu Gly Ala Val Asn Uu He Ser Arg Phe A$p> 



760 270 7 

X X « X 

CGG ATG GCA GCA 6GT G6C CCC CTC 

GCC TAC CCT C6T CCA CCG GGC 6A& 

Arg Hel Ala Ala 6ly 6ly Pro leu 



10 790 300 

XX X XX X 

TAC ATA GAC 6TG ACC TGG CAC CCA GCA 

ATG TAT CTG CAC TGG ACC GIG GGT CGT 

Tyr He Asp Val Thr Trp His Pro Aia> 



319 320 330 

« X X X X 

GGT GAC CCT GGC TCA 6AC AAG GAG 
CCA CTG GGA CCC AGT CTG TIC CTC 
6ly Asp Pro Gly Ser Asp lys Glo 



3U 3S0 

X X X X X 

ACC TCC TC( ATG AT6 ATC GCC A6C ACC 
TGG AGG AGG TAC TAC TAG CGG TCG TGG 
Thr Ser Ser Hel Hel He Ala Ser Thr» 
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2/18 

}6t 37t 380 39D (00 

G(C 616 AA( IA[ T6T G&C CTG GAG A[[ AU [16 [AC AT6 AC( 16( T6[ [6T 
C66 (AC TIG ATG ACA [(G GAC [T( TGG TAG 6A[ GIG 1A[ TGG A(6 A[G GCA 
kii Val Asn Tyr (fs Gly leu Glu Ihr II? leu His Het Ihr [jfs ()ts Arg^ 



no (79 U9 UO (St 

XXXX XX XXX X 

CA6 C6C CTG GAG GAG ATC AC6 GGC CAT (TG CA( AAA GCT AAG CAG CTG GGC 
GTC GCG GAC CTC CTC TA& T6C CCG GTA GAC GTG TIT CGA TTC GTC GAC CCG 
Gin Arg leu Giu Gtu lie Ihr GIf His leu His Ip Ala l|is 6lfl leu 6if> 



460 (10 kit (90 W SIO 

XX X XX X XX X XX 

CTG AAG AAC ATC AI6 GCG CTG CGG G6A GAC CCA ATA 6GT GAC CAG TGG GAA 
GAC TTC TIG TAG TAC CGC GAC fiCC CCI CTG GGT TAT CCA CTG GTC ACC CTl 
Uu lys Asn lie Kel Ala Leu Ar; Gly Asp Pro lie GIf Asp Gin Trp Glu> 



S70 S30 S(0 SSO SiO 

xxxxxxxxxx 

GAG GAG GAG 6GA GGC TTC AAC lAC GCA GTG GAC CTG GTG AAG CAC ATC CGA 
CTC CTC CTC CCI CCG AAG TIG ATG CGI CAC CTG GAC CAC TIC GTG TAG GCI 
Glu Glu Glu Cly Gly Phe Asn lyr Ala Val Asp lev Val lys His He Ar$> 



570 SSO S90 Hi (10 

XXXXXXXXXX 

AGI GAG ITT GGT GAC TAC TIT GAC ATC TGI GTG GCA GGT TAC CCC AAA GGC 
TCA CTC AAA CCA CTG ATG AAA CTG TAG ACA CAC CGI CCA ATG GG6 TIT CC6 
Ser Glu Phe Gly Asp Tyr Phe Asp He Cys Val Ala Gly Tyr Pro lys Gly> 



(29 (30 ((9 (St ((t 

XXXX XXXXXX 

CAC CCC GAA GCA G6G AGC ITT GAG GCT GAC CTG AAG CAC TIC AAG GAG AAG 

GTG GGG CTI CGI CCC ICG AAA CTC CGA CTG GAC TTC GIG AAC TTC CTC TIC 

His Pro Glu Ala Gly Ser Phe Glu Ala Asp leu lys His leu lys Glu lys> 



(70 689 

X X X X X 

GTG TCI GCG GGA GCC GAT TTC ATC 

CAC AGA CGC CCI CGG CIA AAG TAG 

Val Ser Ala Gly Ala Asp Phe He 



690 790 710 

.X X X X X 

ATC ACG CAO CIT TIC III GAG GCT GAC 
TAG IGC GTC GAA AAG AAA CTC CGA CTG 
lie Ihr Gl» leu Phe Phe Glu Ala Asp> 
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720 7)0 740 

k X X XX X 

ACA nc 1H [6[ 111 GIG AAG G[A T6C 
T6T AAG AAG GCG AAA [A( TIC [61 A[G 
Ihr Phe Phe Arg Pht Val lys Ala (fs 



770 780 790 

X X X X X 

AIC GU CC( 6GG AK ITT ((( ATC (AG 

TAG (AG 6GG C(( TAG AAA GGG TAG GI( 

lie V)l Pro GIr lie Phe Prt He Gin 



S20 830 8(0 

XX X XX 

GT6 AAG (TG T[[ AAG (IG GAG GIG CCA 
(AC TIC GAC AGG TIC GAC (I( CA( GGT 
Val l|rs Leu Ser lys Leu Giu Val Pro 





750 


760 






X X 


X X 


X 


AC( 


GAC ATG 


GG( AK ACT TGC 


CC( 


IGG 


(TO IA( 


((G TAG IGA ACG 


GGG 


Thr 


Asp Hel 


Glf lit Ikr lj% 


Pro 




800 


810 




X 


X 


X X 


1 


GG( 


TA{ (AC 


T(( (IT (GG (AG 


(IT 


CCG 


AIG GIG 


AGG 6AA 6C( GK 


GAA 


GIr 


Iff His 


Ser leu Arg Gin 


Le«> 




8S0 


860 




X 


X 


X X 


X 


CAG 


GAG AK 


AAG GAC GIG AIT 


GAG 


GT( 


CTC TAG 


TT( (IG (AC TAA 


(K 


Gin 


Glu He 


Lys Asp Val He 


6lu> 



870 880 B90 

X X X X X 

(CA ATC AAA GAC AAC GAT GCT 6(( AK 

GGT TAG TIT CTG TIG CIA (GA CGG TAG 

Pro He lys Asp Asn Asp Ala Ala He 



900 9tO 

XX XXX 

(G( AAC TAT GC( AK GAG (TG 6(( 
G(G TIG AIA CCG TAG CK GAC CGG 
Arg Asn T|fr Glf He Glu leu Ala> 



920 930 9*0 

X XX X XX 

GIG AGC (TG TG( (AG GAG (IT (TG G(C 

(AC ICG GAC ACG GTC (T( GAA GA( (GG 

Val Ser leu (ys Gin Glu leu leu Ala 



9S0 960 

X V X X 

AGT GG( TIG GIG ((A GG( (K (AC 

KA ((G AA( (A( GGT ((G GAG GIG 

Ser Gl| leu Val Pro Gl; leu His> 



970 980 990 

XX X XX X 

TT( TA( A(( (K AAC CGC GAG ATG GCT 

AAG ATG IGG GAG TIG GCG (K lAC CGA 

Phe I^r Ihr Leo Asn Arg Glu Met Ala 



1080 lOiO 1070 

XX XXX 

ACC ACA GAG GIG CTG AAG CGC CTG 

IGG TGI (K (A( GA( IK GCG 6AC 

Ihr Ihr Glu Val leu L|s Arg leu> 



10)0 



1040 



10S0 



1060 



1070 



GGG ATG IGG ACT GAG GAC ((( AGG (GI (CC (lA (C( IGG GCT (IC AGT GCC 
CC( TA( A(( IGA (K CTG GGG K( G[A GGG CAT GGG ACC (GA GAG KA (GG 
GIf Hel Irp Ibr Glu Asp Pro Arg Arg Pro leu Pro Irp Ala leu Ser Ala> 
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ItgO 1098 nOO 1110 1120 

[AC ([( AAG (GC (6A GAG GAA GAT GIA (GT ([[ AK TIC TGG 6CC TCC AGA 

GIG GGG lie GCG GCl CTC ClI CIA CAI GCA 666 TAG AAG ACC CGG AGG TCI 

His Pro Iji Arg Arg Glu Glu Asp Vil Arg Pro lie Phe Irp Ala Ser Arg* 



1131 1U0 list II6t 1170 

% t It it * t SXM 



X 

CCA AAG AGT lAC AIC lAC CGI ACC CA6 

GGI TIC TCA AI6 TAG AIG GCA I6G 6IC 

Pro l|S Ser Tyr He lyr Arg Ihr 6ln 



GAG TGG GAC CAG IK CCI AAC 6GC 
CTC ACC CTG CTC AAG 6GA TIG CCG 
Glu Irp Asp Glu Phe Pro Asa 6lf> 



nSl 1190 

X K X X 

C6C IG6 GGC AAT TCC TCI 
GCG ACC CCG TTA AGG AGA 
Arg Irp 61; Asn Ser Ser 



1210 

XX X 

TCC CCI GCC ITT GGG 
AGG GGA CCG AAA CCC 
Ser Pro Ala Phe Gl; 



1210 1220 

' X X X 

GAG CTG AAG GAC TAC TAC 
CTC GAC TIC CTG AIG AIG 
Glu leu Ifs Asp Tfr T]fr> 



1230 1240 12S0 1260 1270 

XX X XX X XX X XX 

CTC TTC TAC CTG AAG AGC AAG TCC CCC AAG GAG GAG CTG CTG AAG AIG TGG 

GAG AAG AIG GAC TTC ICG TTC AGG GGG TTC CTC CTC GAC GAC TTC TAC ACC 

leu Phe I]fr leu lys Ser lys Ser Pro Ifs Glu Glu leu leo lys Net Trp> 



12B0 1290 1300 

X X X X X 

GGG GAG GAG CTG ACC AGT GAA GCA AGT 
CCC CTC CTC GAC IG6 ICA CIT CCI TCA 
Gly Glu Glu leu Ihr Ser Glu Ala Ser 



1310 1320 

X X X X X 

6TC ITT GAA GTC TIT GII CIT TAC 

CAG AAA CTT CAG AAA CAA GAA AIG 

Val Phe Glu Val Phe Val leu Iyr> 



1330 13(0 13S0 1360 1370 

XX XXXXXXXX 

CTC ICG GGA GAA CCA AAC CGG AAI GGT CAC AAA GIG ACT T6C CTG CCC TGG 
GAG AGC CCI CTT GGT TIG GCC ITA CCA GIG ITT CAC I6A ACG GAC GGG ACC 
leu Ser Gly Glu Pro Asn Arg Asn Gly His lys Val Ihr Cys Leu Pro Trp> 



1380 1390 UOO 

X > X X X 

AAC GAT GAG CCC CTG GCG GCT GAG ACC 

TIG CTA CTC GGG GAC C6C CGA CTC TGG 

Asn Asp Glu Pro leu Ala Ala Glu Ihr 



U10 U20 

X X X X X 

AGC CTG CTG AAG GAG GAG CTG CTG 

ICG GAC GAC TTC CTC CTC GAC GAC 

Ser lea leu lys Glu Glu leu lev> 
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U30 U40 use 

X K X X XX 

[66 616 AAC f6C [AG GG[ AT[ [T[ A([ 
G[[ [A[ TTG 6[Q GT[ [[G TAG GAG TG6 
Arg Vtl Asn Ar; Gin Gly He Leu Thr 



U60 UIO 

X XXX 

AH AA[ HA [AG (([ AA( AT( AA[ 

TAG TTG AGT GT[ GGG TTG TAG TTG 

He Asn Ser Gin Pro Asn He Asn> 



1U0 U90 tSlO 1510 tS70 1S30 

XXX XX XXX XXX 

GGG AAG (CG TCC TCC GAC C(( ATC GT6 66C 166 66C ([( AG[ GGG 66[ TAI 

((( TI( CGC AGG AGG (TG GGG TAG (A( (CG AC[ [(G GGG TCG [CC CCG ATA 

Gl| LfS Pro Ser Ser Asp Pro He Val Gly Trp Gly Pro Ser Gly Gly Tyr> 



ISU 1SS0 

X K X X 

GT[ TTC [AO AAG G[[ TA[ TTA GAG 
(AG AAG GT( TTC (GO AI6 AAT (T( 
Val Pbe Gin lys Ala Tyr Leu Glu 



1S(D 1S70 isie 

XX X XX X 

TTT TT( A[T T([ [G[ GAG A(A GC6 GAA 
AAA AAG T6A AGG 6CG [T( TGT (G[ CTT 
Phe Phe Thr Ser Ar; Glu Thr Ala 6I«> 



tS9D 1600 tilO U20 1(30 

xxxxxxxxxx 

G[A [TT (TG (AA GTG (TG AAG AAG TA[ GAG (T( C6G GTT AAT TA[ [A[ (TT 
(GT GAA GAC GTT CAC GAC TTC TT[ ATG CT[ GAG G(C [AA HA ATG GTG GAA 
Ala leu leu Gin Val leu lys lys Tyr Glu let Arg Val Asn Tyr His Lei> 



16(0 1650 1660 1670 1680 

XX XX XXX XXX 

GT[ AAT GTG AAG GGT GAA AA[ ATC A[[ AAT G[( [(T GAA CTG [AG ((G AAT 
[AG TTA (A[ TT[ ((A (IT TTG TAG TG6 TTA (GG G6A [TT 6A[ GT[ 6G[ TTA 
Val Asn Vat lys Gly Glu Asn He Thr Asn Ala Pro Glu leu Gin Pro Asn> 



1690 



1700 



171o 



1720 



17)0 



G[T GT[ A[T TGG GG[ AT[ TT( ((T GGG (GA GAG AT( AT[ [AG ([( AC[ GTA 
(GA (AG TGA A([ ((G TAG AAG GGA (([ G[T (TC TAG TAG GTC GGG TGG (AT 
Ala Val Thr Trp Gly He Phe Pro Gly Arg Glu He He Gin Pro Thr Val* 



1740 



1750 



1760 



1770 



17S0 



GTG GAT [[[ GTC AGC TTC ATG TTC TGG AAG GAC GAG GC( TTT GH [TG TGG 
(A[ [TA GGG (AG T[6 AAG TAC AAG A[( TTC CTG [T[ (GG AAA (GG GA[ A(( 
Val Asp Pro Vat Ser Phe Met Phe Trp lys Asp Glu Ala Phe Ala Leo Trp> 



r ^=1 - IF 
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1190 mi tSIt 1820 ll}0 

ATI 6A6 (6e IGG 66A AAG (TC UI 6A6 GAG GAG T(( CCG U( [Gt A(( AK 
1AA CU G(( AC( [(T in GA( AIA (K (U [T[ AGG GGC AGG G(G IGG TAG 
lie Gil Ar) Irp GIf lys leu I]rr fiU Gig Glii Ser Pro Ser Arj Ihr Ile> 



mt iiso mt mi iiso 

XXKBXIXKKX 

AK (AG TA( Arc CA[ GAC AAC TA[ TIC CT6 GIC AAC CTG GIG GAC AAI GAC 
TAG GT( ATG TAG GIG (IG TIG ATG AAG GA[ CAG TIG GAC (AC CTG TTA CTG 
He Gin Tyr He His Asp Asn Tyr Phe leu Val Asn leu Val Asp Asn Asp> 



1B90 1900 IflO 1920 19]0 

X t lit I XS X XX 

TTC CCA (TG GAC AAC TGC CT( IGG (AG GIG GIG 6AA GAC ACA TIG GAG CTT 
AAG C6T GAC (16 ITG ACG GAG ACC GK (AC CAC CTT (TG TGT AA( (TC CAA 
Phe Pro leu Asp Asn (ys leu Trp Gin Val Val Glu Asp Thr leu Glu leu* 



1)40 mt 19(0 1910 1900 1990 

X KX X XX X X XXX 

(T( AA( AGG ((( A(( CAG AAI GCG AGA GAA ACG GAG GCT CCA TGA(((TGCG 

GAG TIG ICC GGG IGG GT( TTA (GC TCI CTT TGC (T( CGA GGT AC1GGGACGC 

Leu Asn Ar; Prs Thr Gin Aso Ala Arg Glu Ihr Giu Ala Pro> 

2000 2010 2020 2030 20(0 2050 

XX XX XX XX XV XX 

TCCTGA(G(( (T6(GIT66A G((A(T((T6 T([(G([n( (IC(T((A(A 6T6(TG(TT( 

AGGACTGCGG 6A(G(AA((T (GGTGAGGAT AGGG(G&AAG GA66AGGT6T (A(GA(GAAG 



20(0 2010 2010 2090 2100 2110 

X H XX XX Xk XX KX 

TdlGGGAAC T((A(T(TCC TKGTGICTC T(((A((((G GC(T((A[T( ((((ACCTGA 

ACAACCCTTG AGGTGA6AGG AAG(A(AGA6 AGGGTGGGGC C66A6GTGA6 GGGGTGGACT 



2120 2130 2U0 21S0 2160 2110 

XX XX XX XX XX XX 

(AATG&(AG( TAGAdGGAG TGAGGCTKC AGGCTCIKC TGGAdTGAG TCGGdCCAC 

GTTACCGKG ATCTGA(CTC ACIC(GAAGG TCCGAGAAGG A((TG6A[T( AG((GGGGIG 



2110 2190 22O0 2210 2220 

«ii «x >< >ii >x 

ATGGGAACd AGTACKTCT GdCTAAAAA AAAAAAAAAA AAAG6AATTC 
TACCCITGGA TCATGAGAGA CGA6ATTTTT ITTTTTTTTT TITCdTAAG 

IF 
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ARG 



GATC 




GATC 




to CM 



Ci^ ^ to <o 
Y"* T"** 
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ARG GLN 

GATC GATC 



G 

*GC 
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G 
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K I X X X ff 

AAI ICC G6A 6CC AIG AAC 6AA 6C( A6A 66A AAC A6C A&C CTC AA( CC( I6C 176 &Afi it 

Htl Yal Asi 6li AU Art filf Asi S(r Str Iti Asa Pri [;s In fiU U 

XIX XXX 

6C( AGT m A&[ A61 GG[ AOI GAG A6( U[ AAA GAT AGT 1(6 AGA TGT !(( A[( CCG GG( t7( 

Glf Ser Ala Sr Ser Glf Str Gli Ser Ser Ijs Asp Set Ser Ar; [p Ser Ibr Pro Gly )( 

XXX XXX 

(16 GA( ((I GAG (66 (AT GAG AGA (Tt (66 GAG AA6 AT6 AG6 (66 (GA TIG 6AA T(l 6&T HI 

Leu Asp Pre 6li Ar} His 6le Ar; In Arj 6lti Ip Htl Ar; Ar| Ar; leu 6h Ser 6lf Si 

X X X X I X 

6A( AAfi m m ICC CI6 tAA TIC IK CCI CCI CGA ACT 6CT GAG GGA 6CT GTC AA! CIC Hi 

kit Irp fit Str lt» Gil fht Pht Pri Prt Ar; Thr Ala 6I« Gl| Ala Val ki» lev U 

XXI XIX 

AIC TCA A6G TIT GAC CG6 AIG GCA GCA GGI G6C CCC CTC lAC ATA GAC GIG ACC IGG CAC 311 

lit Ser Arf Pie Asp Ar§ Hel Ala Ala GIf i\j Pn Ces lyr He Asp Val Thr Trp His U 

XXX X X K 

CCA GCA GGI GAC CCI GGC ICA GAC AAG GAG ACC ICC ICC AIG AIG AIC GCC AGC ACC GCC 3GI 

Prt Ala Gly Asp Prt GIr Ser Asp IfS Gl« Ikr Ser Ser Hel He> lit Ala Str Ikr Ala lli 

X X X I X X 

GIG AAC lAC IGI GGC CIG GAG ACC AIC CI6 CAC AIG ACC IGC TGC CGI CAG CGC CIG GAG Ul 

Val Asi Ifr Cys Gly In Gil Ikr lit lt« His Htl Thr CfS Cfs Ar) Gli Ar) Lei Gl» 1)i 

« > ( I I I 

GAG AIC ACG GGC CAT CIG CAC AAA GCI AAG CAG CIG GGC CIG AAG AAC AIC AIG 6C6 CIG (II 

Gil He Ikr Gly His Lei His lys Ala lys Gin lei Giy lei Lys Asi He iiel Ala leu iSi 

I I « > ( > 

CGG GGA GAC CCA AIA GGI GAC CAG TGG GAA GAG GAG GAG GGA GGC TIC AAC lAC GCA GIG 541 

Ar) Gly Asp Pn lit Gly Asp Gli Trp Glu Gli Gii Gli Gly Gly Pke Asi lyr Ala Val Mi 

X > ( III 

GAC CIG GIG AAG CAC AIC CGA AGT GAG TIT GGT GAC TAC TIT GAC AIC IGT GIG GCA GGI III 

Asp Lei Val lys His lie Ar) Ser Glu Pkt Gly Asp lyr Pkt Asp He Cys Val Ala Gly III 

t I « III 

TAC CCC AAA GGC CAC CCC GAA GCA GGG AGC ITT GAG GCI GAC CIG AAG CAC TIG AAG GAG ill 

Tyr Pn lys Gly His Pn Glo Ala Gly Ser Pkt GIv Ala Asp Iti lys His Itii lys Gl« 711 

I I « I I » 

AAG GT6 TCI GCG GGA GCC GAl TIC AIC AIC ACG CAG CTT TIC ITT GAG GCI GAC ACA ITC 121 

lys Val Str Ala Gly Ala Asp Pkt He He Ikr Gin Iti Pkt Pkt GIv Ala Asp Tkr Pke 23i 
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I 


1 




X 








TT[ (G( 


III 


GIG 


AA6 G(A I6( kU m 


AIG GG( AK Ad I6[ 


(({ AK 6K [(( 


666 


AK 


III 


Pbt Ar| 


Pkt 


Val 


L|s Ala (p Ikr Asp 


Htl Glf lit Ihr (fs 


Pri lit Val Pr» 


61 r 


lit 


m 


in [[( 


AU 


I 

(AG 


X 

G6( IA[ (AC Ut [11 


X X 

(66 (AG (II GI6 AAG 


K 

[IG I[( AAG (IG 


6AG 


X 

GIG 


M 


Pbr fri 


lie 


Gil 


hi) lyr His Str leu 


Arj Gil 1(1 Val Ifs 


Ub Ser Ijs lei 


GU 


Val 


m 


[(A [A6 


GAG 


It 

AH 


AAG GA( GIG AIT GAG 


S X 

[(A AH AAA GA[ AAC 


X 

GAI Gd G(( AK 


(6( 


X 

AA( 


III 


Pri 6li 


Gil 


lit 


l|s Asp Val He 6I« 


Pra lie Ip Asp Asa 


Asp Ala Ala lit 


Ari 


Asn 






AU 


t 

GAG 


(IG G([ 616 A6[ [16 


X X 

fG[ [AG GAG [ir [TG 


X 

GC( AGI G6( IIG 


616 


X 

((A 


lii 


Iff GIf 


lit 


Gl« 


1(1 Ala Val Str Iti 


(ys Gin GU lei lei 


Ala Ser Gl; lei 


Val 


Pro 


m 


fiS( (T( 


(A( 


X 


X 

IA( A([ lU hhl [6( 


X X 

GAG AIG G(T A([ A(A 


X 

GA6 6IG [IG AAG 


(6[ 


X 

(16 


1171 


6lr leu 


His 


Pke 


Iff Ibr lei Ash Ar§ 


GU Mel AU Tkr Ikr 


Gla Val Iti lys 


Ar, 


ltd 


1)1 


G6& Alt 


IGG 


X 

Ad 


X 

GAG 6A{ ((( AGG (61 


X X 

([( (FA [([ T6G G[I 


X 

in AGI G[( (A( 


(({ 


X 

AAG 


111! 


6ly Htl 




Ikr 


Gil Asp Prg Ar| Arf 


Pri lei Pn Irp Ala 


Iti Str Ala His 


Prt 


lys 

X 


m 


(III luA 


GAG 


t 

t kk 


X 

rit rtk fTT rrr itf 
uAI ulA lul III All 


X X 

iif Tcn rrr irr iti 

III IttU Ull III AuA 


X 

rri iir irr Tir 
ILn MU AUl lAl 


All 


Tir 

lAl 




Arf Arq 


GU 




Asp Vdl Ar) Pre He 


Phe Irp AU Ser Ar) 


Pn Ifs Ser lyr 


lit 


X 


m 


[61 A[( 


[AG 


s 

GAG 


X 

IGG 6A( GAG IK (d 


X X 

AA[ G6[ (G[ IGG GGl 


AAI HI HI m 


((I 


6(( 


nil 


Ar) Ikr 


Gil 


Gil 


Up Asp Gil Pkt Pri 


Asa filf Ar) Irp Giy 


Asi Ser Ser Ser 


Pn 


Ala 

X 


m 


Til GG6 


GAG 


t 

(16 


AAG GAl 1A[ lAl iU 


X X 

lie IA[ [IG AAG AG( 


X 

AAG I[( [(( AAG 


GAG 


6AG 


mt 


Pke 61 r 


GU 


lt» 


Ip Asp ]fr lyr leo 


Pke Ifr lee l|S Ser 


lys Ser Pn lys 


GU 


GU 

X 




[16 [16 


AAG 


AIG 


K 

IGG GGG GAG GAG IIG 


S X 

A[( A6I GAA G(A AGI 


GK III GAA GK 


in 


611 


nil 


1(8 ICI 


Lys 


Hei 


trp GIt GU GItt les 


Ikr Str Gil Ala Str 


Val Pkt Gil Val 


Pkt 


Val 




[II IA[ 


(I( 


1 

KG 


X 

GGA GAA ((A AA[ (66 


X x 

AAI GGI (A[ AAA GIG 


1 

Ad IG( (16 ((( 


166 


X 

AA[ 


nil 


Iti I;r 


Lt« 


Ser 


Glj 6I« Pri AsR Ar; 


AsR Gly His l|S Val 


Ikr (ys Iti Pr» 


Irp 


ASR 

X 




GAT GAG 


[[( 


X 

(16 


I 

&[G G[I GAG A[[ AG[ 


1 X 

(IG [IG AAG GAG GAG 


(16 [IG (66 GIG 


AA( 


[G[ 


m 


Asp Glu 


Pf» 


Iti 


Ala Ala 6|g Ikr Str 


Itu Iti Lys Gil Gl« 


Iti Lti Art Val 


Asi 


Ari 


d( 



1-^'==r HE 
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I X I XII 



fill 

HIP 


G6( 

filr 
• If 


AH 
Ilr 


(I( A[( AH 
lf( Ikr He 

LCIf IITI XIC 


AA[ 
Asa 


I(A tAG t(( AAt 

Spr fill Pm A^it 

tfLI VVii ivv njif 


Alt 
He 


AAt 6GG AAG 
AsR 61} l|$ 


ttG Itt 
Pri Ser 


i[( &A( ((( m 

Ser Asp Pr« kH 


m 
lit 


&J6 


m 


I 

I6G GG( ([( 

Tttt (iir Prs 


X 

A6( 


X 

6GG 66( lAI Git 
GIf filf Iff Vil 


lit 

Pbr 

J lit 


tAG AAG 

Gin L|s 


Gtt 
All 


> 

lAt HA 

1« r \ Pi 

III L V ■ 


X 

GAG III lit 15(1 
Gil Pke Pke SU 

UtV IHC IHC JiV 


ACf 

Thr 

1 Hi 


Srr 

J C 1 


[GC 
Aro 


X 

GAG A(A 6(6 
Gin Ihr All 


X 

6AA 

U 1 w 


X 

6(A tll tIG tAA 

k \ 3 I fit \ 0tt Atl 
n% 9 icv LCv uiii 


GIG 

f a f 


X 

[16 AAG 
lev Ip 


AAG 


X 

lAt GAG 
Iir Gli 

f r f II 1 1 


X 

tit tGG GII u;i 

lay An Vil SU 


AM 

n J u 


TA[ 
Tir 


[A{ 
Hie 


X 

(II GU AAI 
In Vil AtD 

LCW fit n»n 


X 

GIG 
V2I 

f n 1 


X 

AAG 661 6AA AAt 

LlJ "*J UIV n>ll 


Alt 
He 

1 i C 


X 

Att AAI 

Thr An 
IHI nSi 


Gtt 

Al a 

Ai a 


X 

([I 6AA 
Pr0 Gill 


1 

tlG tAG ttG till 
lei Gil Prs SS( 

lev WIH ffV J J9 


AM 
Asa 


6(1 
Ala 


OK 
Vit 


X 

Ad IG6 66( 
Ikr Irp GIf 


X 
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